Karaoke processing method, apparatus and system

ABSTRACT

A karaoke processing method, a karaoke processing apparatus and a karaoke processing system are provided. The method includes: acquiring an accompaniment audio file, an original vocal audio file and a lyrics file of a song, where the lyrics file includes time information for identifying display time of lyrics; dividing the lyrics file to obtain a first and a second lyrics parts and generating lyrics division information; generating a target audio file by outputting the accompaniment audio file in the display time of the first lyrics part and outputting the original vocal audio file in the display time of the second lyrics part; and sending the target audio file and a target lyrics file to a client upon reception of a karaoke request sent by the client, where the target lyrics file is obtained by adding the lyrics division information in the lyrics file.

The present application is the national phase of International PatentApplication No. PCT/CN2016/081650, filed on May 11, 2016, which claimsthe priority to Chinese Patent Application No. 201510280087.7, titled“KARAOKE PROCESSING METHOD, APPARATUS AND SYSTEM”, filed on May 27, 2015with the Chinese State Intellectual Property Office, both of which areincorporated herein by reference in their entireties.

TECHNICAL FIELD

The present disclosure relates to the field of audio processing, and inparticular to a karaoke processing method and apparatus and a karaokeprocessing system.

BACKGROUND

With the improvement of the quality of life of people, karaokeapplications are used more and more in people's daily life, withincreasingly rich functions.

The karaoke application is a combination of a music player and recordingsoftware, which can play an original vocal, record the song of the user,and mix the recorded song with accompaniment to obtain a music file. Theuser can further upload the above-described music file to the Internet,so that more people hear his/her song.

Typically, when the user records his/her song, the karaoke applicationprovides an accompaniment audio file to serve as accompaniment forrecording the song of the user. However, in this way, the user can onlysing alone, while cannot have the karaoke experience of singing with astar.

SUMMARY

A karaoke processing method, a karaoke processing apparatus and akaraoke processing system are provided according to the embodiments ofthe present disclosure, to realize the karaoke experience of singingwith a star.

A karaoke processing method is provided according to a first aspect ofthe embodiments of the present disclosure, which includes:

acquiring, by a server, an accompaniment audio file, an original vocalaudio file and a lyrics file of a song, where the lyrics file includestime information for identifying display time of lyrics;

dividing, by the server, the lyrics file to obtain a first lyrics partand a second lyrics part and generating lyrics division information,where the lyrics division information includes identificationinformation for identifying the first lyrics part;

generating, by the server, a target audio file by outputting theaccompaniment audio file in the display time of the first lyrics partand outputting the original vocal audio file in the display time of thesecond lyrics part; and

sending, by the server, the target audio file and a target lyrics fileto a client upon reception of a karaoke request sent by the client,where the target lyrics file is obtained by adding the lyrics divisioninformation in the lyrics file.

A karaoke processing method is provided according to a second aspect ofthe embodiments of the present disclosure, which includes:

an acquisition unit configured to acquire an accompaniment audio file,an original vocal audio file and a lyrics file of a song, where thelyrics file includes time information for identifying display time oflyrics;

a processing unit configured to divide the lyrics file to obtain a firstlyrics part and a second lyrics part, and generate lyrics divisioninformation, where the lyrics division information includesidentification information for identifying the first lyrics part;

a generation unit configured to generate a target audio file byoutputting the accompaniment audio file in the display time of the firstlyrics part and outputting the original vocal audio file in the displaytime corresponding to the second lyrics part; and

a sending unit configured to send the target audio file and a targetlyrics file to a client upon reception of a karaoke request sent by theclient, where the target lyrics file is obtained by adding the lyricsdivision information in the lyrics file.

A karaoke processing system is provided according to a third aspect ofthe embodiments of the present disclosure, which includes a server and aclient.

The server includes the karaoke processing device according to thesecond aspect.

The client is configured to provide karaoke service to a user by usingthe target audio file as accompaniment audio and using the target lyricsfile as lyrics to be played.

In the technical solution according to the embodiment of the presentdisclosure, the server first acquires an accompaniment audio file, anoriginal vocal audio file and a lyrics file of a song, where the lyricsfile includes time information for identifying a display time of lyrics.Then the lyrics file is divided to obtain the first lyrics part and thesecond lyrics part, and lyrics division information is generated, wherethe lyrics division information includes identification information foridentifying the first lyrics part. Then a target audio file is generatedby outputting the accompaniment audio file in the display time of thefirst lyrics part and outputting the original vocal audio file in thedisplay time corresponding to the second lyrics part. Accordingly, uponreception of the karaoke request sent by a client, the server sends thetarget audio file and the lyrics division information to the client sothat the client provides karaoke service to the user by using the targetaudio file as the accompaniment audio and based on the above-describedlyrics division information. In the embodiment of the presentdisclosure, the target audio file sent by the server to the clientincludes the accompaniment in the display time of the first lyrics part,and the original vocal in the display time of the second lyrics part.Therefore, in the embodiment of the present disclosure, it can berealized that the song is sung by the user in the display time of thefirst lyrics part, and originally sung by the star in the display timeof the second lyrics part, thereby providing the karaoke experience ofsinging with the star.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram of a karaoke processing method accordingto an embodiment of the present disclosure;

FIG. 2 is a schematic diagram of a karaoke processing method accordingto another embodiment of the present disclosure;

FIG. 3 is a schematic diagram of a karaoke processing device accordingto an embodiment of the present disclosure;

FIG. 4 is a schematic diagram of a karaoke processing system accordingto an embodiment of the present disclosure; and

FIG. 5 is a schematic structural diagram of a server to which a karaokeprocessing method is applied according to an embodiment of the presentdisclosure.

DETAILED DESCRIPTION OF THE EMBODIMENTS

A karaoke processing method and apparatus and a karaoke processingsystem are provided according to embodiments of the present disclosure,to realize the karaoke experience of singing with a star, which aredescribed in detail respectively as follows.

The technical solutions according to the embodiments of the presentdisclosure will be described clearly and completely as follows inconjunction with the drawings in the embodiments of the presentdisclosure. It is apparent that the described embodiments are only apart of the embodiments according to the present disclosure, rather thanall the embodiments. Any other embodiments obtained by those skilled inthe art based on the embodiments in the present disclosure without anycreative efforts fall within the scope of protection of the presentdisclosure.

The terms “first”, “second”, “third”, “fourth” and the like (if present)in the specification and claims of the present disclosure and theabove-described drawings are intended to distinguish between similarobjects and not necessary to describe specific order or sequence. It isto be understood that the data thus used is interchangeable whereappropriate so that the embodiments described herein can be implementedin an order other than the content illustrated or described herein. Inaddition, the terms “include” and “have” and any variations thereof areintended to cover non-exclusive inclusion, for example, processes,methods, systems, products or devices that include a series of steps orunits are not necessarily limited to those steps or units clearlylisted, but may include other steps or units that are not clearly listedor inherent to these processes, methods, products, or devices.

It should be noted that the client in the embodiment of the presentdisclosure may be a mobile phone, a personal computer (PC, PersonalComputer) and/or a tablet computer. In practical implementation, one canlog into a web page via the clients, or load the program on the clients,or use applications (App, Application) issued from various mobileplatforms to access the karaoke service.

In the embodiment of the present disclosure, the server is connected tothe client via the network. The server may be set up by serviceproviders (for example, karaoke service providers, etc.) to manage andoperate the provided services, including various servers.

Referring to FIG. 1, an embodiment of a karaoke processing method in theembodiment of the present disclosure includes the following steps 101 to104.

In step 101, the server acquires an accompaniment audio file, anoriginal vocal audio file and a lyrics file of a song.

In the embodiment, both the accompaniment audio file and the originalvocal audio file are raw audio data. In practical application, in a casethat the obtained accompaniment audio file or the original vocal audiofile are audio data in a compressed format, such as the mp3 format, theserver decompresses the audio data in the compressed format to obtainthe raw audio data.

It should be noted that, in the embodiment, the lyrics file includestime information for identifying the display time of lyrics so that thelyrics file is synchronized with the accompaniment audio file and theoriginal vocal audio file based on the time information. For example,the lyrics file may include an LRC lyrics file that contains time tagsfor identifying the display time of lyrics.

In step 102, the server divides the lyrics file to obtain a first lyricspart and a second lyrics part and generates lyrics division information.

In the embodiment, the server may divide the lyrics file by variousmethods to obtain the first lyrics part and the second lyrics part,which are illustrated respectively as follows.

In a first method, the server determines paragraph information of thelyrics file based on the time information, then divides the lyrics filebased on the paragraph information to obtain the first lyrics part andthe second lyrics part.

In the embodiment, the time information of the lyrics file may reflect astarting point and an end point of the display time corresponding toeach paragraph of the song. Therefore, the server may determine thenumber of paragraphs included by the lyrics file and the starting pointand the end point of the display time corresponding to each paragraphbased on the time information. When dividing the lyrics file based onthe above-described paragraph information, the server may take at leastone preset paragraph as the first lyrics part and the other part as thesecond lyrics part. The number of paragraphs of the first lyrics partand the specific paragraph(s) in the first lyrics part are not limitedherein. For example, a structure of the paragraph information may berepresented as:

typdef struct section_info { int section_id; //paragraph id intstart_ms; //staring time ms int end_ms; //ending time ms int role_id;//role id } Ssection, *PSection;

In the structure, the role id indicates whether the section correspondsto a female vocal part or a male vocal part. It is preferable that theparagraph corresponding to the female vocal part may be set as the firstlyrics part or the second lyrics part, so as to divide the lyrics basedon the male and female roles to obtain the first lyrics part and thesecond lyrics part of the different roles.

In a second method, the server determines a verse part and a refrainpart of the lyrics file, and determines the verse part as the firstlyrics part and the refrain part as the second lyrics part, ordetermines the refrain part as the lyrics part and the verse part as thesecond lyrics part.

In the present embodiment, the server may divide the lyrics based on thedifference between the refrain and the verse so as to obtain the firstlyrics part corresponding to the verse and the second lyrics partcorresponding to the refrain, or obtain the first lyrics partcorresponding to the refrain and the second lyrics part corresponding tothe verse.

In a third method, the server divides the lyrics file based on a customtemplate, to obtain the first lyrics part and the second lyrics part.

In the present embodiment, the server may customize in advance multipletemplates for dividing the lyrics, and then divides the lyrics filesbased on the above-described custom templates to obtain the first lyricspart and the second lyrics part. For example, the fixed N sentences oflyrics or N words are set as the first lyrics part. The specific methodfor setting the custom template is not limited here, and those skilledin the art can set a variety of different templates according to theactual situation.

It is to be noted that, in the present embodiment, the above is only afew examples of how to divide the lyrics file to obtain the first lyricspart and the second lyrics part. In practice, the server may combine theabove-described division methods, or may use other division methods,which is not limited here.

Optionally, in the present embodiment, before performing step 102, theserver may receive a division instruction sent by the client forinstructing a division method specified by the user. Then the serverdivides the lyrics file based on the division instruction to obtain thefirst lyrics part and the second lyrics part, thereby allowing the userto choose how to divide the lyrics, and obtaining the first lyrics partand the second lyrics part as desired by the user.

In the present embodiment, after the first lyrics part and the secondlyrics part are obtained by the division processing, the server furtherrecords the division result to generate lyrics division information,where the lyrics division information includes identificationinformation for identifying the first lyrics part. Here, since thelyrics consists of two parts in total, the server, the client or thelike can identify the corresponding second lyrics part with theidentification information identifying the first lyrics part.Apparently, in other embodiments, the lyrics division information mayfurther include identification information for identifying the secondlyrics part.

In step 103, the server generates a target audio file by outputting theaccompaniment audio file in the display time of the first lyrics partand outputting the original vocal audio file in the display time of thesecond lyrics part.

In the present embodiment, during the process of generating the targetaudio file, the accompaniment audio file is outputted in the displaytime corresponding to the first lyrics part to acquire the audio data ofthe accompaniment audio file in the corresponding time period. Theoriginal vocal audio file is outputted in the display time correspondingto the second lyrics part to acquire the audio data of the originalvocal audio file in the corresponding time period. Then the audio dataat each time point are synthesized to an audio file in accordance withthe time sequence. The synthesized audio file is the target audio file.It is to be emphasized that, during the process of generating the targetaudio file described above, which audio data of the accompaniment audiofile and the original vocal audio file is to be outputted in which timeperiod is determined based on the display time of the lyrics. Thereby,when the target audio file is played, the accompaniment is outputted inthe display time corresponding to the first lyrics part, and theoriginal vocal is outputted in the display time corresponding to thesecond lyrics part. That is, the target audio file can be automaticallyswitched between the accompaniment and the original vocal.

Optionally, in the present embodiment, the method may further include:performing, by the server, balance processing on the audio data of thetarget audio file at the switching point. Specifically, the switchingpoint refers to the time point for switching from the display timecorresponding to the first lyrics part to the display time correspondingto the second lyrics part or the time point for switching from thedisplay time corresponding to the second lyrics part to the display timecorresponding to the first lyrics part. Here, since the target audiofile is obtained by splicing two audio files, the audio quality of thetarget audio file can be ensured by performing the balance processing atthe splicing point. In the present embodiment, the conventional audiosplicing technology may be adopted as the balance processing method,such as average filtering, superposition processing, and the like.

In step 104, upon reception of the karaoke request sent by a client, theserver sends the target audio file and a target lyrics file to theclient, where the target lyrics file is obtained by adding the lyricsdivision information in the lyrics file.

In the present embodiment, upon reception of the karaoke request sent bythe client, the server sends the target audio file and the target lyricsfile to the client so that the client provides karaoke service to theuser by using the target audio file as the accompaniment audio and usingthe target lyrics file as the lyrics to be played, where the targetlyrics file is obtained by adding the lyrics division information in theabove-described lyrics file. In the present embodiment, when the clientplays the target audio data, the accompaniment is outputted in thedisplay time of the first lyrics part, and the original vocal isoutputted in the display time of the second lyrics part. Thereby, thesong is sung by the user in the display time of the first lyrics part,and is originally sung by a star in the display time of the secondlyrics part, realizing the karaoke experience of singing with the star.It is to be understood that in this embodiment, the target audio fileand the target lyrics file are generated only by the server, and thensent to the client, thereby providing the karaoke experience of singingwith the star without changing the client logic.

In the technical solution according to the embodiment of the presentdisclosure, the server first acquires the accompaniment audio file, theoriginal vocal audio file and the lyrics file of the song, where thelyrics file includes time information for identifying the display timeof lyrics. Then the lyrics file is divided to obtain the first lyricspart and the second lyrics part, and the lyrics division information isgenerated, where the lyrics division information includes identificationinformation for identifying the first lyrics part. Then a target audiofile is generated by outputting the accompaniment audio file in thedisplay time corresponding to the first lyrics part and outputting theoriginal vocal audio file in the display time corresponding to thesecond lyrics part. Thereby, upon reception of the karaoke request sentby the client, the server sends the target audio file and the lyricsdivision information to the client so that the client provides thekaraoke service to the user by using the target audio file as theaccompaniment audio and based on the above-described lyrics divisioninformation. In the embodiment of the present disclosure, the targetaudio file sent by the server to the client includes the accompanimentin the display time of the first lyrics part, and the original vocal inthe display time of the second lyrics part so that it can be realizedthat the song is sung by the user in the display time of the firstlyrics part, and is originally sung by the star in the display time ofthe second lyrics part, thereby providing the karaoke experience ofsinging with the star.

Next, based on the embodiment shown in FIG. 1, it is further describedhow the client provides the karaoke service to the user by using thetarget audio file as the accompaniment audio and using the target lyricsfile as the lyrics to be played. Referring to FIG. 2, a karaokeprocessing method according to another embodiment of the presentdisclosure includes the following steps 201 to 207.

In step 201, the server acquires the accompaniment audio file, theoriginal vocal audio file and the lyrics file of the song.

In step 202, the server divides the lyrics file to obtain the firstlyrics part and the second lyrics part.

In step 203, the server generates a target audio file by outputting anaccompaniment audio file in the display time of the first lyrics partand outputting the original vocal audio file in the display time of thesecond lyrics part.

In step 204, upon reception of the karaoke request sent by a client, theserver sends the target audio file and a target lyrics file to theclient, where the target lyrics file is obtained by adding the lyricsdivision information in the lyrics file.

In the present embodiment, the processes of steps 201 to 204 are thesame as those of steps 101 to 104 in the embodiment shown in FIG. 1, andwill not be described here.

In step 205, the client plays the target audio file and the targetlyrics file.

In the present embodiment, after the target audio file and the targetlyrics file sent by the server is received by the client, the karaokeapplication on the client plays the above-described target audio fileand the target lyrics file, where the target audio file is played as theaccompaniment audio.

In order to help a user to quickly identify which part of the lyrics isfor him/her to sing and which part of the lyrics is to be sung by thestar, the client may further identify the lyrics displayed on theclient. Preferably, in the embodiment, the client may further provide,based on the lyrics division information, a first identifier for thefirst lyrics part displayed on the client, and a second identifier forthe second lyrics part displayed on the client. It is to be understoodthat the above-described identifier may be a color identifier, forexample, the first lyrics part is displayed with a color and the secondlyrics part is displayed with another color. Alternatively, theabove-described identifier is a highlighted state identifier, forexample, the first lyrics part is displayed in the highlighted state,and the second lyrics part is displayed in a non-highlighted state. Thespecific form of the identifier is not limited herein, and otheridentification form may be obtained by those skilled in the art asrequired.

In step 206, the client records, based on the lyrics divisioninformation, the audio data of the user in the display timecorresponding to the first lyrics part.

In the present embodiment, the client may identify the display timecorresponding to the first lyrics part and the display timecorresponding to the second lyrics part based on the lyrics divisioninformation, and record the audio data of the user in the display timecorresponding to the first lyrics part, and stop recording in thedisplay time corresponding to the second player part, at which time thetarget audio file played by the client corresponds to the original vocalof the star. Thereby, it can be realized that the song is sung by theuser in the display time of the first lyrics part, and is originallysung by the star in the display time of the second lyrics part,providing the karaoke experience of singing with the star.

In step 207, the client mixes the audio data of the user and the targetaudio file to obtain an antiphonal singing file.

In the present embodiment, the obtained antiphonal singing file includesthe audio data of the singing of the user in the display time of thefirst lyrics part, and the audio data of original vocal of the star inthe display time of the second lyrics part (that is, the audio data ofthe original vocal audio file in the corresponding time period).Optionally, in the present embodiment, the user may also upload theantiphonal singing file to the Internet by the client so as to sharewith others the song he/she sings with the star.

In the present embodiment, the specific process of the mixing processingmay be performed by using the conventional mixing algorithm of, forexample, adding the data at corresponding sampling points and dividingthe sum by two.

The karaoke processing method in the embodiment of the presentdisclosure has been described above, and the karaoke processing devicein the embodiment of the present disclosure will be described below.Referring to FIG. 3, the karaoke processing device according to anembodiment of the present disclosure includes an acquisition unit 301, aprocessing unit 302, a generation unit 303 and a sending unit 304.

The acquisition unit 301 is configured to acquire an accompaniment audiofile, an original vocal audio file and a lyrics file of a song, wherethe lyrics file includes time information for identifying the displaytime of the lyrics.

The processing unit 302 is configured to divide the lyrics file toobtain a first lyrics part and a second lyrics part and generate lyricsdivision information, where the lyrics division information includesidentification information for identifying the first lyrics part.

The generation unit 303 is configured to generate a target audio file byoutputting the accompaniment audio file in the display time of the firstlyrics part and outputting the accompaniment audio file in the displaytime of the second lyrics part.

The sending unit 304 is configured to send the target audio file and atarget lyrics file to a client upon reception of a karaoke request sentby the client so that the client provides karaoke service to the user byusing the target audio file as the accompaniment audio and using thetarget lyrics file as the lyrics to be played, where the target lyricsfile is obtained by adding the lyrics division information in the lyricsfile.

For better understanding, the operation flow of the karaoke processingdevice in the present embodiment will be described in the following witha specific application scenario as an example.

The acquisition unit 301 acquires the accompaniment audio file, theoriginal vocal audio file and the lyrics file of the song, where thelyrics file includes time information for identifying the display timeof lyrics. The processing unit 302 divides the lyrics file to obtain thefirst lyrics part and the second lyrics part, and generates the lyricsdivision information, where the lyrics division information includesidentification information for identifying the first lyrics part. Thegeneration unit 303 generates a target audio file by outputting theaccompaniment audio file in the display time corresponding to the firstlyrics part and outputting the original vocal audio file in the displaytime corresponding to the second lyrics part. Upon reception of thekaraoke request sent by the client, the sending unit 304 sends thetarget audio file and a target lyrics file to the client so that theclient provides karaoke service to the user by using the target audiofile as the accompaniment audio and using the target lyrics file as thelyrics to be played, where the target lyrics file is obtained by addingthe lyrics division information in the lyrics file.

In the technical solution according to the embodiment of the presentdisclosure, the acquisition unit 301 first acquires the accompanimentaudio file, the original vocal audio file, and the lyrics file of thesong, where the lyrics file includes time information for identifyingthe display time of the lyrics. Then the processing unit 302 divides thelyrics file to obtain a first lyrics part and a second lyrics part andgenerates the lyrics division information, where the lyrics divisioninformation includes identification information for identifying thefirst lyrics part. Then the target audio file is generated by thegeneration unit 303 by outputting the accompaniment audio file in thedisplay time of the first lyrics part and outputting the original vocalaudio file in the display time of the second lyrics part. Thereby, uponreception of the karaoke request sent by the client, the sending unit304 sends the target audio file and the lyrics division information tothe client so that the client provides the karaoke service to the userby using the target audio file as the accompaniment audio and based onthe above-described lyrics division information. The target audio filesent by the karaoke processing device in the server according to theembodiment of the present disclosure to the client includes theaccompaniment in the display time of the first lyrics part, and theoriginal vocal in the display time of the second lyrics part. Therefore,with the embodiment of the present disclosure, it can be realized thatthe song is sung by the user in the first lyrics part of the displaytime, and is originally sung by the star in the display time of thesecond lyrics part, thereby providing the karaoke experience of singingwith the star.

Optionally, in the present embodiment, the processing unit 302 may beconfigured to determine paragraph information of the lyrics file basedon the time information, and divide the lyrics file based on theparagraph information to obtain a first lyrics part and the secondlyrics part.

Optionally, in the present embodiment, the processing unit 302 may beconfigured to determine a verse part and a refrain part of the lyricsfile, and determine the verse part as the first lyrics part and therefrain part as the second lyrics part, or determine the refrain part asthe first lyrics part and the verse part as the second lyrics part.

Optionally, in the present embodiment, the processing unit 302 may beconfigured to divide the lyrics file based on a custom template toobtain a first lyrics part and a second lyrics part.

Optionally, in the present embodiment, the karaoke processing device mayfurther include a balance processing unit 305.

The balance processing unit 305 is configured to perform balanceprocessing on the audio data of the target audio file at a switchingpoint, where the switching point refers to the time point for switchingfrom the display time corresponding to the first lyrics part to thedisplay time corresponding to the second lyrics part or the time pointfor switching from the display time corresponding to the second lyricspart to the display time corresponding to the first lyrics part.

The karaoke processing method and the karaoke processing device in theembodiment of the present disclosure have been described above, and thekaraoke processing system in the embodiment of the present disclosurewill be described below. Referring to FIG. 4, the karaoke processingsystem according to an embodiment of the present disclosure includes aserver and a client.

The server 401 includes the karaoke processing device as described inthe embodiment shown in FIG. 3.

The client 402 is configured to provide the karaoke service to the userby using the target audio file as the accompaniment audio and using thetarget lyrics file as the lyrics to be played.

Optionally, in the present embodiment, the client may be configured toplay the target audio file and the target lyrics file, record the audiodata of the user in the display time corresponding to the first lyricspart based on the lyrics division information; and mixes the audio dataof the user and the target audio file to obtain an antiphonal singingfile.

Optionally, in the present embodiment, the client may further beconfigured to provide, based on the lyric division information, a firstidentifier for the first lyrics part displayed on the client and asecond identifier for the second lyrics part displayed on the client.

It will be apparent to those skilled in the art that, for convenienceand simplicity of description, the specific processes of theabove-described system, device and unit are not described here, and onemay refer to the corresponding processes in the foregoing embodiments ofthe method for details.

Reference is made to FIG. 5, which shows a schematic structural diagramof a server to which the above-described karaoke processing method isapplied according to an embodiment of the present disclosure. In thepresent embodiment, the server is implemented as a computer or a similarcomputing device.

As shown in FIG. 5, the computing device 500 includes one or more (onlyone is shown) processors 502, a memory 504 and a transmission module506. It may be understood by those skilled in the art that the structureshown in FIG. 5 is merely illustrative and does not limit the structureof the above-described electronic device. For example, the computingdevice 500 may also include more or fewer components than those shown inFIG. 5, or have a different configuration from that shown in FIG. 5.

The memory 504 may be used to store software programs and modules, suchas program instructions/modules corresponding to the karaoke processingmethod in embodiments of the present disclosure. The processor 502executes various function applications and data processing by runningsoftware programs and modules stored in the memory 504, to implement theabove-described karaoke processing method. The memory 504 may include ahigh speed random access memory, and may include nonvolatile memory,such as one or more magnetic storage devices, flash memories, or othernonvolatile solid state memories. In some instances, the memory 504 mayfurther include a memory remotely provided with respect to the processor502, which may be connected to the device 500 via a network. Instancesof the above-described networks include, but not limited to theInternet, the intranet, the local area network, the mobile communicationnetwork, and combinations thereof.

The transmission module 506 is used to receive or transmit data via anetwork. Specific instance of the above-described network may include awired network and a wireless network. In an instance, the transmissionmodule 506 includes a network interface controller (NIC) that can beconnected to other network devices such as a router via a network cableto communicate with the Internet. In an instance, the transmissionmodule 506 is a radio frequency (RF) module for wirelessly communicatingwith the Internet.

In the embodiments according to the present application, it should beunderstood that the disclosed system, device and method may beimplemented in other ways. For example, the device embodiments describedabove are only illustrative. For example, the units are divided based ona logic function thereof, and they may be divided in another way inpractice. For example, multiple units or components may be combined orintegrated into another system, or some features may be omitted or notperformed. In addition, a displayed or discussed coupling, directcoupling or communication connection may be an indirect coupling orcommunication connection via some interfaces, devices or units, and maybe in an electrical form, a mechanical form or another form.

The units illustrated as separate components may be separated physicallyor not, and the component displayed as a unit may be a physical moduleor not. That is, the components may be located at the same place, or maybe distributed on multiple network units. A part of or all of the unitsmay be selected to realize the object of the solution of the embodimentaccording to actual need.

In addition, various functional units in various embodiments of thepresent disclosure may be integrated into one processing unit, thevarious functional units may also be separate physical units, or two ormore units may be integrated into one unit. The above integrated unitsmay be implemented by hardware or implemented by a software functionalunit.

The integrated unit may also be stored in a computer readable storagemedium in a case that the integrated unit is implemented by the softwarefunctional unit and sold or used as an independent product. Based onthis understanding, the technical solution of the present disclosure,either essentially or in part, contributes to the prior art, or all orpart of the technical solution may be embodied in the form of a softwareproduct stored in a storage medium, including several instructions forenabling a computer device (which may be a personal computer, a server,or a network device, etc.) to perform all or part of the steps of themethod according to the various embodiments of the present disclosure.The aforementioned storage medium includes a variety of media capable ofstoring the program code such as a USB disk, a mobile hard disk, aread-only memory (ROM, Read-Only Memory), a random access memory (RAM,Random Access Memory), a magnetic disk, or an optical disk.

As described above, the above embodiments are merely provided fordescribing the technical solutions of the present disclosure, but arenot intended to limit the present disclosure. Although the presentdisclosure is described in detail with reference to the foregoingembodiments, those skilled in the art should understand that, they canstill modify technical solutions described in the foregoing embodiments,or make equivalent substitutions to a part or all of the technicalfeatures; and such modifications or substitutions do not make theessence of corresponding technical solutions to depart from the spiritand the scope of the embodiments of the present disclosure.

The invention claimed is:
 1. A karaoke processing method, comprising:acquiring, by a server, an accompaniment audio file, an original vocalaudio file and a lyrics file of a song, wherein the lyrics filecomprises time information for identifying display time of lyrics;dividing, by the server, the lyrics file to obtain a first lyrics partand a second lyrics part and generating lyrics division information,wherein the lyrics division information comprises identificationinformation for identifying the first lyrics part; generating, by theserver, a target audio file by outputting the accompaniment audio filein the display time of the first lyrics part and outputting the originalvocal audio file in the display time of the second lyrics part; andsending, by the server, the target audio file and a target lyrics fileto a client upon reception of a karaoke request sent by the client,wherein the target lyrics file is obtained by adding the lyrics divisioninformation in the lyrics file.
 2. The karaoke processing methodaccording to claim 1, wherein the dividing by the server the lyrics fileto obtain the first lyrics part and the second lyrics part comprises:determining, by the server, paragraph information of the lyrics filebased on the time information; and dividing, by the server, the lyricsfile based on the paragraph information to obtain the first lyrics partand the second lyrics part.
 3. The karaoke processing method accordingto claim 1, wherein the dividing by the server the lyrics file to obtainthe first lyrics part and the second lyrics part comprises: determining,by the server, a verse part and a refrain part of the lyrics file, anddetermining the verse part as the first lyrics part and the refrain partas the second lyrics part, or determining the refrain part as the firstlyrics part and the verse part as the second lyrics part.
 4. The karaokeprocessing method according to claim 1, wherein the dividing by theserver the lyrics file to obtain the first lyrics part and the secondlyrics part comprises: dividing, by the server, the lyrics file based ona custom template to obtain the first lyrics part and the second lyricspart.
 5. The karaoke processing method according to claim 1, furthercomprising: performing, by the server, balance processing on the audiodata of the target audio file at a switching point, wherein theswitching point is the time point for switching from the display timecorresponding to the first lyrics part to the display time correspondingto the second lyrics part, or the time point for switching from thedisplay time corresponding to the second lyrics part to the display timecorresponding to the first lyrics part.
 6. The karaoke processing methodaccording to claim 1, further comprising: providing, by the client,karaoke service to a user by using the target audio file asaccompaniment audio and using the target lyrics file as lyrics to beplayed.
 7. The karaoke processing method according to claim 6, whereinthe providing by the client karaoke service to a user by using thetarget audio file as accompaniment audio and using the target lyricsfile as lyrics to be played comprises: playing, by the client, thetarget audio file and the target lyrics file; recording, by the client,audio data of the user in the display time corresponding to the firstlyrics part; and mixing, by the client, the audio data of the user andthe target audio file to obtain an antiphonal singing file.
 8. Thekaraoke processing method according to claim 7, further comprising:providing, by the client based on the lyrics division information, afirst identifier for the first lyrics part displayed on the client and asecond identifier for the second lyrics part displayed on the client. 9.A karaoke processing device, applied to a server, comprising: anacquisition unit configured to acquire an accompaniment audio file, anoriginal vocal audio file and a lyrics file of a song, wherein thelyrics file comprises time information for identifying display time oflyrics; a processing unit configured to divide the lyrics file to obtaina first lyrics part and a second lyrics part, and generate lyricsdivision information, wherein the lyrics division information comprisesidentification information for identifying the first lyrics part; ageneration unit configured to generate a target audio file by outputtingthe accompaniment audio file in the display time of the first lyricspart and outputting the original vocal audio file in the display timecorresponding to the second lyrics part; and a sending unit configuredto send the target audio file and a target lyrics file to a client uponreception of a karaoke request sent by the client, wherein the targetlyrics file is obtained by adding the lyrics division information in thelyrics file.
 10. The karaoke processing device according to claim 9,wherein the processing unit is configured to determine paragraphinformation of the lyrics file based on the time information, divide thelyrics file based on the paragraph information to obtain the firstlyrics part and the second lyrics part.
 11. The karaoke processingdevice according to claim 10, further comprising: a balance processingunit configured to perform balance processing on the audio data of thetarget audio file at a switching point, wherein the switching point isthe time point for switching from the display time corresponding to thefirst lyrics part to the display time corresponding to the second lyricspart, or the time point for switching from the display timecorresponding to the second lyrics part to the display timecorresponding to the first lyrics part.
 12. The karaoke processingdevice according to claim 9, wherein the processing unit is configuredto determine a verse part and a refrain part of the lyrics file, anddetermine the verse part as the first lyrics part and the refrain partas the second lyrics part, or determine the refrain part as the firstlyrics part and the verse part as the second lyrics part.
 13. Thekaraoke processing device according to claim 12, further comprising: abalance processing unit configured to perform balance processing on theaudio data of the target audio file at a switching point, wherein theswitching point is the time point for switching from the display timecorresponding to the first lyrics part to the display time correspondingto the second lyrics part, or the time point for switching from thedisplay time corresponding to the second lyrics part to the display timecorresponding to the first lyrics part.
 14. The karaoke processingdevice according to claim 9, wherein the processing unit is configuredto divide the lyrics file based on a custom template to obtain the firstlyrics part and the second lyrics part.
 15. The karaoke processingdevice according to claim 14, further comprising: a balance processingunit configured to perform balance processing on the audio data of thetarget audio file at a switching point, wherein the switching point isthe time point for switching from the display time corresponding to thefirst lyrics part to the display time corresponding to the second lyricspart, or the time point for switching from the display timecorresponding to the second lyrics part to the display timecorresponding to the first lyrics part.
 16. The karaoke processingdevice according to claim 9, further comprising: a balance processingunit configured to perform balance processing on the audio data of thetarget audio file at a switching point, wherein the switching point isthe time point for switching from the display time corresponding to thefirst lyrics part to the display time corresponding to the second lyricspart, or the time point for switching from the display timecorresponding to the second lyrics part to the display timecorresponding to the first lyrics part.
 17. A karaoke processing system,comprising a server and a client, wherein the server comprises thekaraoke processing device according to claim 9; and the client isconfigured to provide karaoke service to a user by using the targetaudio file as accompaniment audio and using the target lyrics file aslyrics to be played.
 18. The karaoke according to claim 17, wherein theclient is configured to play the target audio file and the target lyricsfile, record audio data of the user in the display time corresponding tothe first lyrics part based on the lyrics division information, and mixthe audio data of the user and the target audio file to obtain anantiphonal singing file.
 19. The karaoke processing system according toclaim 18, wherein the client is further configured to provide, based onthe lyrics division information, a first identifier for the first lyricspart displayed on the client and a second identifier for the secondlyrics part displayed on the client.